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1. Introduction 



1.1. Infinity-Laplacian and games 

For an integer m > 2 let a bounded C 2 domain G C M m , and functions g G C(dG, M) and 
h G C(G,IR\{0}) be given. We study a two-player zero-sum stochastic differential game (SDG), 
defined in terms of an m-dimensional state process that is driven by a one-dimensional Brow- 
nian motion, played until the state exits the domain. The functions g and h serve as terminal, 
and, respectively, running payoffs. The players' controls enter in a diffusion coefficient and in 
an unbounded drift coefficient of the state process. The dynamics are degenerate in that it is 
possible for the players to completely switch off the Brownian motion. We show that the game 
has value, and characterize the value function as the unique viscosity solution u (uniqueness 
of solutions is known from [9] ) of the equation 

-2Z\ooU = h in G, 

(1.1) 

= g on dG. 

Here is the infinity-Laplacian defined as A^f = (Df)'(D 2 f)(Df)/\Df\ 2 , provided Df ^ 
0, where for a C 2 function / we denote by Df the gradient and by D 2 f the Hessian matrix. 
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Our work is motivated by a representation for u of Peres et. al. 0] (established in fact in a 
far greater generality), as the limit, as e — * 0, of the value function V s of a discrete time 
random turn game, referred to as Tug-of-War, in which e is a parameter. The contribution of 
the current work is the identification of a game for which the value function is precisely equal 
to u. 

The infinity-Laplacian was first considered by Aronsson []J in the study of absolutely minimal 
(AM) extensions of Lipschitz functions. Given a Lipschitz function u defined on the boundary 
dG of a domain G, a Lipschitz function u extending u to G is called an AM extension of u 
if, for every open U C G, Lipjju = Lipg^n, where for a real function / defined on F C M m , 
Lip F f = svp x y£p x ^y \f(x) — f(y)\/\x — y\. It was shown in [1] that a Lipschitz function u on G 
that is C 2 on G is an AM extension of u\qg if an d only if u is infinity-harmonic, namely satisfies 
AoqU = in G. This connection enables in some cases to prove uniqueness of AM extensions via 
PDE tools. However, due to the degeneracy of this elliptic equation, classical PDE approach 
in general is not applicable. Jensen |7[ showed that an appropriate framework is through the 
theory of viscosity solutions, by establishing existence and uniqueness of viscosity solutions to 
the homogeneous version (h = 0) of (jl.ljl . and showing that if g is Lipschitz then the solution 
is an AM extension of g. In addition to the relation to AM extensions, the infinity-Laplacian 
arises in a variety of other situations [3(] . Some examples include models for sand-pile evolution 
0], motion by mean curvature, and stochastic target problems 0], 10]. 



We do not treat the homogenous equation for reasons mentioned later in this section. The 
inhomogeneous equation may admit multiple solutions when h assumes both signs Our 
assumption on h implies that either h > or h < 0. Uniqueness for the case where these strict 
inequalities are replaced with weak inequalities is unknown 0] . Thus the assumption we make 
on h is the minimal one under which uniqueness is known to hold in general (except the case 
h = 0). 

Let us describe the Tug-of-War game introduced in [9|. Fix e > 0. Let a token be placed at 
x £ G, and set Xq = x. At the A;-th step of the game (k > 1), an independent toss of a fair coin 
determines which player takes the turn. The selected player is allowed to move the token from 
its current position X^-i G G to a new position Xk in G, in such a way that \Xk — A^_i| < e 
(0] requires \X^ — < e but this is an equivalent formulation in the setting described 

here). The game ends at the first time K when Xk G dG. The associated payoff is given by 



E 



2 K-i 

g(X K ) + KX k ) 

k=0 



;i.2) 



Player I attempts to maximize the payoff and Player IPs goal is to minimize it. It is shown 
in [9(] that the value of the game, defined in a standard way and denoted V £ (x), exists, that 
V £ converges uniformly to a function V referred to as the "continuum value function", and 
that V is the unique viscosity solution of (jl.ip (these results are in fact also proved for the 
homogeneous case, and in generality greater than the scope of the current paper). The question 
of associating a game directly with the continuum value was posed and some basic technical 
challenges associated with it were discussed in [9(. 
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Our approach to the question above is via a SDG formulation. To motivate the form of 
the SDG we start with the Tug-of-War game and present some formal calculations (a precise 
definition of the SDG will appear later). Let k E N} be a sequence of i.i.d. random variables 
on some probability space {Q,!F,P) with P(£& = 1) = P(S,k = —1) = 1/2, interpreted as the 
sequence of coin tosses. Let {Tk}k>o be a filtration of T to which {^} is adapted and such 
that {£fc+i ; 6c+2 • • •} is independent of Tk for every k > 0. Let {a/c}, {bk} be {J^j-predictable 
sequences of random variables with values in B e (0) = {x £ M m : |x| < e}. These sequences 
correspond to control actions of Player I and II; that is, a/% [resp., bk] is the displacement 
exercised by Player I [resp., Player II] if it wins the fc-th coin toss. Associating the event 
= 1} with Player I winning the fc-th toss, one can write the following representation for 
the position of the token, starting from initial state x. For j G N, 



x j = x + 2^ [ ak ^r~ + k ^?~\ = ^ — 2 — ^ k ^ 

k=\ k=l k=l 



We shall refer to {Xj} as the 'state process'. This representation, in which turns are not taken 
at random but both players select an action at each step, and the noise enters in the dynamics, 
is more convenient for the development that follows. Let e = l/y/n and rescale the control 
processes by defining, for t > 0, A™ = \/na[ nt ], Bf = \/nb[ nt ]. Consider the continuous time 
state process Xf = X\ nt \, and define {Wp}t>o by setting Wff = and using the relation 



k- 1\ ^ ,k-\ k 



k e N. 



W t n = W$_ 1)/n + (t ) yfrtk, t G 

Then we have 

Xt=x + \ f(A n s - B?)dW s n + \f ^l(A n s + B")ds. (1.3) 
2 Jo 2 Jo 

Note that W n converges weakly to a standard Brownian motion, and since \Af\ V \Bf\ < 1, the 
second term on the right hand side of (jl.3p forms a tight sequence. Thus it is easy to guess a 
substitute for it in the continuous game. Interpretation of the asymptotics of the third term is 
more subtle, and is a key element of the formulation. One possible approach is to replace the 
factor yfn by a large quantity that is dynamically controlled by the two players. This point of 
view motivates one to consider the identity (Proposition 15. 1| ) 



-2A oa f= sup inf Lha-b)'(D 2 f)(a-b)-(c + d)(a + b)-Df\, 

|6|=i,d>o |a|=M>o I 2 J ( L4 ) 

feC 2 , Df^O, 

for the following reason. Let H = S m ~ l x [0, oo) where S m ~ l is the unit sphere in W 71 . The 
expression in curly brackets is equal to C a ' b,c,d f (x) , where for (a, c), (b,d) G H, C a,b,c,d is the 
controlled generator associated with the process 

X t = x + (\a s - B s )dW s + [\c s + D S )(A S + B s )ds, t G [0, oo), (1.5) 

JO JO 

and (A, C) and (B, D) are control processes taking values in TL. Since Z\oo is related to fjl .3j) via 
the Tug-of-War, and C a,b)C ' d to (11. 5j) . identity (|1.4p suggests to regard (|1.5j) as a formal limit of 
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(|1.3p . Consequently the SDG will have (jl.5p as a state process, where the controls (A, C) and 
(B,D) are chosen by the two playes. Finally, the payoff functional, as a formal limit of (|1.2ft . 
and accounting for the extra factor of 1/2 in (jl.3p . will be given by Sf/J' h(X s )ds + g(X T )], 
where r = inf{t : ^ G} (with an appropriate convention regarding r = oo). 

A precise formulation of this game is given in Section 11.21 along with a statement of the 
main result. Section [1.31 discusses the technique and some open problems. 

Throughout, we will denote by §(m) the space of symmetric m x m matrices, and by I m G 
S(m) the identity matrix. A function •& : [0, oo) — * [0, oo) will be said to be a modulus if it is 
continuous, nondecreasing, and satisfies i?(0) = 0. 



1.2. SDG formulation and main result 



Recall that G is a bounded C 2 domain in R m , and that g : dG -> R and /i : G -> R \ {0} 
are given continuous functions. In particular we have that either /i > or h < 0. Since the 
two cases are similar, we will only consider h > 0, and use the notation h := inf^/i > 0. 
Let (17, {Tt}, P) be a complete filtered probability space with right-continuous filtration, 
supporting an (m + l)-dimensional {J^j-Brownian motion W = (W, W), where W and W are 
1- and m-dimensional Brownian motions, respectively (in Section [1.3l we comment on the need 
for the auxiliary Brownian motion W). Let E denote expectation with respect to P. Let Xt 
be a process taking values in R m , given by 

X t = x + [\a s - B s )dW s + f\c s + D S )(A S + B s )ds, t e [0, oo), (1.6) 

where x G G, At and Bt take values in the unit sphere 5 m_1 C R m , and Ct and Dt take values 
in [0, oo). Denote 

Y° = (A,C), Z° = (B,D). (1.7) 

The processes Y° and Z° take values in 7i = S m ~ l x [0, oo). These processes will correspond 
to control actions of the maximizing and minimizing player, respectively. Let 

r = inf{t : X t G dG}. 

Throughout we will follow the convention that the infimum over an empty set is oo. We write 

X(x,Y°,Z°) [resp., t(x,Y°,Z )} (1.8) 

for the process X [resp., the random time r] when it is important to specify the explicit 
dependence on (x,Y , Z°). If r < oo a.s., then the payoff J(x, Y°,Z°) is well defined with 
values in (—00,00], where 



J(x,F u ,Z u ) = E 



h(X s )ds + g(X T 
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and X is given by ([TBjl . When P(t(x,Y°,Z ) = oo) > 0, we set J(x,y°,Z°) = oo, in 
agreement with the expectation of the first term in (|1.9p . 

We turn to the precise definition of the SDG. For a process H = (A, C) taking values in TL 
we let S(H°) = ess sup sup te [ 0oo ) Ct- In the formulation below, each player initially declares a 
bound S, and then plays so as to keep S(H°) < S. 

Definition 1.1. (i) A pair H = ({H®}, S), where S£N and {H®} is a process taking values in 
TL, is said to be an admissible control if{H®} is {Ft} -progressively measurable, and S(H°) < S. 
The set of all admissible controls is denoted by M. For H = ({H®}, S) G M , denote S(H) = S. 
(ii) A mapping g : M — > M is said to be a strategy if, for every t, 

P(H° = #° for a.e. se [0, t]) = 1 and S = S 

implies 

P(J° = jo for a.e. se [0, t]) = 1 andT = f, 

where (J°,T) = g[(H°,S)] and (I°,T) = g[(H°,S)]. The set of all strategies is denoted by T. 
For g £ T, let S(g) = sup HeM S(g[H]). Let 

r = {gef: S(g) < oo}. 



We will use the symbols Y and a for generic control and strategy for the maximizing player, 
and Z and (3 for the minimizing player. If Y = (Y°,K),Z = (Z°,L) G M, we sometimes 
write J(x, Y, Z) = J(x, (Y°, K), (Z°, L)) for J(x, Y°, Z°). Similar conventions will be used for 
X(x, Y, Z) and t(x, Y, Z). Let 

J x (Y,j3) = J(x,Y,f3[Y]), x G G, Y G M, (3 G T, 

J x {a,Z) = J(x,a[Z],Z), x G G, a G T, Z G M. 

Define analogously X x (Y,f3), X x (a, Z), t x (Y,/3) and T x (a, Z) via (jl.8p . Define the lower value 
of the SDG by 

V(x) = inf sup J X (Y,(3), (1.10) 

and the upper value by 

U{x) = sup inf J x (a,Z). (1.11) 

aer zeM 

The game is said to have a value if U = V. 

Recall that the infinity-Laplacian is defined by A^f = p'Up/\p\ 2 , where / is a C 2 function, 
p = Df and E = D 2 f, provided that p ^ 0. Thus Ax>/ is equal to the second derivative in the 
direction of the gradient. In the special case where D 2 f(x) is of the form XI m for some real A, 
it is therefore natural to define A 00 f(x) = A even if Df(x) = 0]. This will be reflected in 
the definition of viscosity solutions of (jl.ip , that we state below. Let 

V = {(0, \I m ) G R m x S(m) : A G R}, 
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V x = (R m \ {0}) x §(m) 



23 = 23 U Pi 



and 



2A 



(p,E) = (0,A/ m ) G 23b, 



p'i7p 



Definition 1.2. A continuous function u : G — > M is said to be a viscosity supersolution [resp., 
subsolution] of (jl.ip . if 

(i) for every x £ G and if S C 2 (G) for which (p, U) := (Dip(x), D 2 (p(x)) G 23, and u — (p has a 
global minimum [maximum] on G at x, one has 



and 

(ii) u = g on dG. 

A viscosity solution is a function which is both a super- and a subsolution. 

The following is the main result. 

Theorem 1.1. The functions U and V are both viscosity solutions to equation (jl.ip . Conse- 
quently, the SDG has a value. 

In what follows we use the terms subsolution, supersolution and solution as shorthand for 
viscosity subsolution, etc. 



1.3. Discussion 

We describe here our approach to proving the main result, and mention some obstacles in 
extending it. 

A common approach to showing solvability of Bellman- Isaacs (BI) equations ( (jl.ip can be 
viewed as such an equation due to (|1.4p ) by the associated value function, is by proving that 
the value function satisfies a dynamic programming principle (DPP). Roughly speaking, this 
is an equation expressing the fact that, rather than attempting to maximize their profit by 
considering directly the payoff functional, the players may consider the payoff incurred up 
to a time t plus the value function evaluated at the position Xt that the state reaches at 
that time. Although in a single player setting (i.e. in pure control problems) DPP are well 
understood, game theoretic settings as in this paper are significantly harder. In particular, 
as we shall shortly point out, there are some basic open problems related to such DPP. In 
a setting with a finite time horizon, Fleming and Souganidis 0] established a DPP based 
on careful discretization and approximation arguments. We have been unable to carry out a 



A(p, E) - h(x) > 



[< o]; 



(1.12) 
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similar proof in the current setting, which includes a payoff given in terms of an exit time, 
degenerate diffusion and unbounded controls. 



Swiech HI] has developed an alternative approach to the above problem that relies on 



existence of solutions. Instead of establishing a DPP for the value function, the idea of ll| 



is to show that any solution must satisfy a DPP. To see what is meant by such a DPP and 
how it is used, consider the equation, — 2A 00 u + Xu = h in G, u = g on dG, where A > is a 



constant, associated with the payoff in (|1.9j) modified by a discount factor. Assume that one 
can show that whenever u and v are sub- and supersolutions, respectively, then 



u(x) < sup inf E 



vix) > sup inf E 



-As 



e- Xs h(X s )ds + e- Aa v(X a ) 



h(X s )ds + e- Xa u(X a 



(1.13) 



(1.14) 



for X = X[x, a[Z], Z], r = r[x,a[Z], Z] and a = a(t) = r At. Sending t — > oo in the above 
equations, one would formally obtain 



u(x) < sup inf E 



e- Xs h(X s )ds + e~ AT g(X T ) < v{x) 



At, 



(1.15) 



in particular yielding that if u = v is a solution to the equation then it must equal the upper 
value function. This would establish unique solvability of the equation by the upper value 
function, provided there exists a solution. In the case A > 0, justifying the above formal limit 
is straightforward (see [ill]) but the case A = 0, as in our setting, requires a more careful 
argument. Our proofs exploit the uniform positivity of h due to which the minimizing player 
will not allow r to be too large. This leads to uniform estimates on the decay of P{t > t) as 
t — > oo, from which an inequality as in (jl . 15[) follows readily. This discussion also explains why 
we are unable to treat the case h = 0. 



Establishing DPP as in (|1.13p . (|1.14p is thus a key ingredient in this approach. For a class 
of BI equations, defined on all of M. m , for which the associated game has a bounded action set 
and a fixed, finite time horizon, such a DPP was proved in Swiech ll|. In the current paper, 
although we do not establish (|1.13j) . ()1.14|) in the above form, we derive similar inequalities 
(for A = 0) for a related bounded action game, defined on G. The characterization of the value 
function for the original unbounded action game is then treated by taking suitable limits. 



Both [6|] and [H| require some assumptions on the sample space and underlying filtration. In 
the underlying filtration is the one generated by the driving Brownian motion. The approach 
taken in [ll|, which the current paper follows, allows for a general filtration as long as it is rich 
enough to support an m-dimensional Brownian motion, independent of the Brownian motion 
driving the state process. The reason for imposing this requirement in ll| is that inequalities 
similar to (11.130 and (|1.14p are proved by first establishing them for a game associated with 
a non-degenerate elliptic equation, and then taking a vanishing viscosity limit. This technical 
issue is the reason for including the auxiliary process W in our formulation as well. As pointed 



out in 



111 ], the question of validity of the DPP and the characterization of the value as the 
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unique solution to the PDE, under an arbitrary nitration, remains a basic open problem on 



Both the unboundedness of the action space and the combination of deg eneracy of the 
dynamics and an exit time criterion make it hard to adapt the results of to our setting. 
In order to overcome the first difficulty we approximate the original SDG by a sequence of 
games with bounded action spaces, that are more readily analyzed. For the bounded action 
game, existence of solutions to the upper and lower BI equations follow from We show that 
the solutions to these equations satisfy a DPP similar to f)1.13j) — (|1.14|) (Proposition 14. As 
discussed above, existence of solutions along with the DPP yields the characterization of these 
solutions as the corresponding value functions. Next, as we show in Lemma 12. 5} the upper 
and lower value functions for the bounded action games approach the corresponding value 
functions of the original game, pointwise, as the bounds approach oo. Moreover, in Lemma 
12.41 we show that any uniform subsequential limit, as the bounds approach oo, of solutions 
to the BI equation for bounded action games is a viscosity solution of (jl.lj) . The last piece 
in the proof of the main result is then showing existence of uniform (subsequential) limits. 
This is established in Theorem 12.11 by proving equicontinuity, in the parameters governing the 
bounds, of the value functions for bounded action games. The proof of equicontinuity is the 
most technical part of this paper and the main place where the C 2 assumption on the domain 
is used. This is also the place where the possibility of degenerate dynamics close to the exit 
time needs to be carefully analyzed. 

The rest of this paper is organized as follows. In Section [2] we prove Theorem 1 1 . 1 1 based on 
results on BI equations for bounded action SDG. These results are established in Sections [3] 
(equicontinuity of the value functions) and 0] (relating the value function to the PDE). Finally, 
it is natural to ask whether the state process, obtained under (^-optimal play by both players, 
converges in law as 5 tends to zero. Section [5] contains a discussion on this issue. 



2. Relation to Bellman-Isaacs equation 

In this section we prove Theorem 11.11 by relating the value functions U and V to value func- 
tions of SDG with bounded action sets, and similarly, the solution to (jl.ip to that of the 
corresponding Bellman-Isaacs equations. 

Let p £ M m , p / and S £ S(m) be given, and, for n £ N, fix p n 6 M m , p n ^ and 
S n £ S(m), such that p n — ► p, S n — > S. Denote p = p/\p\ and p n = p n /\Pn\- Let {k n } and {l n } 
be positive, increasing sequences such that k n — ► oo, l n — ► oo. 



SDGs. 



Denote 




(2.1) 



and let 



max min &(a,, b, 

|6|=l,0<d</ \a\=l,0<c<k 



c, d;p, S) 



(2.2) 
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Al,(p,S) = min max <P(a,b,c,d;p,S). (2.3) 

w |o|=l,0<c<fc |6|=l,0<d<i 

Set 

A+(p,S)=Al ln (p,S), A-(p,S)=A- knln {p,S). 
Lemma 2.1. One has A^(p n , S n ) — ► yl(p, 5), and A~(p n , S n ) — ► A(p, S), as n — ► oo. 



Proof. We prove only the statement regarding yl n , since the other statement can be proved 
analogously. We omit the superscript '— ' from the notation. Denote <P n (a,b,c,d) 
= <P(a,b,c,d;p n ,S n ). Let 

A n (a,c) = max <P n (a,b,c,d) 
\b\=l,0<d<l n 

Let (a*,c*) be such that yl* := A n (p n ,S n ) = /l n (a*,c*). Note that A* n < A n (p n ,0), which 
is bounded from above as n — > cxd, since {b + p n ) ■ p n > for all 6 G 5 m_1 , n > 1. On 
the other hand, if for some fixed e > 0, a* • p n < \p n \ — e holds for infinitely many n, then 
limsup /l n (a*, c n ) = oo for any choice of c n contradicting the statement that A* n is bounded 
from above. This shows, for every e > 0, 

\Pn\ ~£ <a* n -p n < \Pn\ 

for all large n. In particular, o* — > p. Next note that 

7i; = A n (a* n ,c* n ) ><P n (-p n ,a* n ,l n ,c* n ) > -\{Pn + a* n )'S n (p n + a* n ) 

hence 

liminf Al > -2p'Sp = A(p, S). 
Also, with (b n ,d n ) £ arg max (M) <P n (b,p n , d, k n ), 

A* n = A n (a* n , c* ) < A n (p n , k n ) = --(6 n - p n )'S n (b n - p n ) - (d n + fc n )(6 n + p n ) • p n 

< -^{K - p n )'S n (b n - p n ). (2.4) 

If b n — > — p does not hold then liminf /l* = — oo by the first line of (|2.4p which contradicts the 
previous display. This shows b n — > —p. Hence from the second line of (|2.4p 

limsupyi; < -2p'Sp = A{p, S). 

□ 



We now consider two formulations of SDG with bounded controls, the first being based on 
Definition 11.11 whereas the second is more standard. For k, I G N let 

M k = {Y G M : S(Y) < k}, 

r l = {per: s(/3) < /}. 
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Define accordingly the lower value 



and the upper value 



V kl (x) = inf sup J x (Y,p), (2.5) 
/3er ; Y&M k 



U k i(x) = sup inf J x (a,Z). (2.6) 

a& r k z ^ M i 



Definition 2.1. (i) A process {Ht} taking values in 7i is said to be a simple admissible control 
if it is {Ft} -progressively measurable. We denote by M° the set of all simple admissible controls, 
and let M° = {H G M° : S{H) < k}. 

(ii) Given k,l G N, we say that a mapping g : — > M^ is a simple strategy, and write g G Jj^ 
z/ ; /or every i, 

P(iT s = # s /or a.e. s G [0,t]) = 1 

zmp/ies 

= Aw a-e.se [0,t]) = 1. 

For (3 G T° kl ,Y G M%, we write J X (Y, f3(Y)) as J x (Y,f3). For q G 1^,2 G M°, J*{a,Z) is 
defined similarly. 

For fc, Z G N, let 

V k ° l (x)= inf sup J*{Y,0), (2.7) 

£/£,(*) = sup inf J x (a,Z). (2.8) 
aer o zeMf 

The following shows that the two formulations are equivalent. 
Lemma 2.2. For every k,l, = Vm and = Uti- 

Proof. We only show the claim regarding V^i- Let /3 G Define 0° G r£ by letting, for 
every Y G M^, be the process component of the pair (3[(Y,k)]. Clearly, for every Y G 

Ml J x ((Y,k),P) = J x (Y,p°), whence sup yeMfc J x (Y,/3) > sup y<EM o J X (Y,(3°), and V kl (x) > 

Next let (3° G 1%. Define £ : M -»■ M z as follows. Given F = (Y°,1T) = (A^C, IT) G M, let 
y fc = (A, C Ak), and set /3[Y] = (P°[Y k ], I). Note that if, for some K, Y° and Y° are elements 
of M% and F°(a) = Y°(s) on [0,t] then Y fc (s) = Y k (s) on [0,t] and so (3°[Y k ] s = (3°[Y k ] s 
on [0,t]. By definition of f3, it follows that (3 G J}. Also, if (Y°,1T) G M fc then K < k and 
thus ^((F ,^),/?) = J x (y°,/3°). This shows that sup ygMfc J X {Y,(3) < su Py o eM o J x (y°, (3°). 

Consequently Vki(x) < V^(x). □ 

Denote V n = Vk n i n and U n = Uk n i n ■ The following result is proved in Section [3l 

Theorem 2.1. For some uq G N, the family {V n ;n > no} is equicontinuous, and so is the 
family {U n ;n > n }. 
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Consider the Bellman-Isaacs equations for the upper and, respectively, lower values of the 
game with bounded controls, namely 

'A+{Du,D 2 u) - h = iaG, 

(2.9) 

u = g on dG. 

A-{Du,D 2 u)-h = iaG, 

(2.10) 

u = g on dG. 

Solutions to these equations are defined analogously to Definition 11.21 with replacing A, 
and where there is no restriction on the derivatives of the test function, i.e., T> is replaced with 
R m x S(m). 

Lemma 2.3. There exists n\ G N such that for each n > n\, U n is the unique solution to 
(|2.9p . and V n is the unique solution to (|2.10p . 

Proof. This follows from a more general result, Theorem 14. II in Section [H □ 
Lemma 2.4. Any subsequential uniform limit ofU n or V n is a solution of (jl.lj) . 



Proof. Denote by Uq [resp., Vb] a subsequential limit of U n [V n ]. By relabeling, we assume with- 
out loss that U n [resp., V n ] converges to Uq [Vb]. We will show that Uq and Vb are subsolutions 
of (jl.ip . The proof that these are supersolutions is parallel. 

We start with the proof that Uq is a subsolution. Fix xq G G. Let 99 G C 2 (G) be such that 
Uq — ip is strictly maximized at xq. Assume first that Dtp(xo) 7^ 0. Since U n — * Uq uniformly, 
we can find {x n } C G, x n — > xq, where x n is a local maximum of U n — (p for n > N. We take 
./V to be larger than n\ of Lemma 12.31 Since by Lemma 12.31 U n is a subsolution of (|2,9p . we 
have that for n > N 

A+(Dv(x n ),D 2 <p{x n )) - h(x n ) < 0. 

Thus by Lemma l2.lt 

A(D<p(xo),D 2 <p(xo))-h(x ) < 0, 

as required. 

Next, assume that Dip(xa) = and D 2 (p(xq) = \I m for some A 6 1. In particular, <p(x) = 
tp(xo) + 4|x — xq\ 2 + o(\x — xq\ 2 ). We need to show that 

- 2A - h(x ) < 0. (2.11) 

Consider the case A > 0. Fix 5 > and let = — xo\ 2 . Then Z/q — ^6 nas a strict 

maximum at x . Since f/ n — ► [/q uniformly, we can find {x n } C G, x n — > xo, where x n is a 
local maximum of U n — To prove (|2.1ip it suffices to show that for each e > 0, 

-2(A + <5)- sup h(x)<0. (2.12) 
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To prove (|2.12p . argue by contradiction and assume that it fails. Then there exists e > such 
that 

-2(A + £)- sup h(x)>0. (2.13) 

x& £ (x ) 

Let N > ni be such that |x n — xq\ < e for all n > N. Since £7 n is a subsolution of (I2.9H . 



M„ := A+(D^s(x n ),D 2 ^ s (x n )) < h(x n ). (2.14) 

Also, 

1 



u n = max mm 

\b\=l,0<d<l n |a|=l,0<c<fc„ 



-(A + S)\a- b\ 2 - (A + tf)(c + d)(a + 6) • (ar„ - s ) 



> min 

|a|=l,0<c<fc„ 



i(A + <y)|a - b n \ 2 ] = -2(A + <5) (2.15) 



2 

where 6 n = — (x n — xq) /\x n — xq\ ii x n ^ xq and arbitrary otherwise. Thus by (|2.13p . 

fj, n > h(x n ). (2.16) 
However, this contradicts (12. 14h . Hence f)2. 12H holds and so (|2.11j) follows. 

Consider now the case A < 0. Let 5 > be such that A + S < 0. Let t/jg be as above. Then 
Uq ~ ips has a strict maximum at xq. Fix e > 0. Then one can find 7 < e such that 

U (x ) = U (x Q ) - if)s(xo) > U (x) - if) S (x), V < \x - x Q \ < 7. (2.17) 

Thus one can find r\ E M m such that < |r/| < 7 and 

U (x ) > U (x) - ip s (x + V) v x e <9B 7 (x ). (2.18) 



Let vpd,r/(x) = ips{x + rj). Let x^ G B 7 (xo) be a maximum point for J7o — ips^ over B 7 (xo). We 
claim that 

x v <9B 7 (xo) and x v 7^ xo — 77. (2.19) 
Suppose the claim holds. Then Dips jri (x v ) 7^ 0, and so from the first part of the proof, 

-2(A + 5) - h{x v ) = A{D^ s , v {x v ),D 2 ^ v (x v )) - h{x v ) < 0. 

Since \x v — xq\ < 7 < e, sending e — > and then J — » yields (|2.1ip . 

We now prove (l2~T9l) . From ([HE]) and the fact that A + 5 < 0, 

sup [U Q (x) - ips, v (x)] < U (x ) < U (x ) - ips, v (xo)- 



Hence x v <9B 7 (xo). Also 

U (x -rj)- tp6,r]{xo -rj) = U (x - rj) < U (x ) + ips(xo - rj) < U (x ) - ips, v (xo), 

where we used f|2. 1T|) and the negativity of the functions ips an d ips,rj- This shows that x v 7^ 
xo — rj, and (12.19P follows. This completes the proof that Uq is a subsolution of (jl.ip . 
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Finally, the argument for Vq differs only at one point. If we had (V n , A~) instead of (U n , A+) 
then instead of (I2.15P we could write 

1 



a n = mm max 

\a\ = l,0<c<k n |6|=l,0<rf<J r 



-(A + 5)\a- b\ 2 - (A + 5)(c + d)(a + b) ■ (x n - x ) 



max 

\b\=lfl<d<l r , 



-(A + <5)|a n - b\ 2 - (A + 6)(cn + d)(a n + b) ■ (x n - x ) 



where (a n ,c n ) achieves the minimum, and then by choosing (6, d) = (— a n ,0), 

/in > -2(A + 5). 

Hence (|2.16p is still true. Rest of the argument for the subsolution property of Vq follows as 
that for Uq. □ 

Lemma 2.5. Fix x £ G. 

(i) One can choose (k n ,l n ) in such a way that limsupyj^^ V n (x) < V(x). 

(ii) One can choose (k n ,l n ) in such a way that liminfn^oo V n (x) > V(x). 
Similar statements hold for U n (x) and U(x). 

Proof. We prove (i) and (ii). The statements regarding U n (x) and U(x) are proved analogously. 

(i) Fix k. Since r = Uz>i we have that given e, 

V{x) > inf sup J X {Y,P) 
Per YeM k 

> inf sup J x (Y,p)-e 
~ Per lYe M k 

= V k i(x) - e, 

for all I sufficiently large. This shows V(x) > limsup^^ V k i(x), and (i) follows. 

(ii) Fix e. For each (k, I) G N 2 let p ki G J] be such that 

sup J x {Y,p kl ) < inf sup J x (Y,(3)+e. (2.20) 

YeM fe P£ r i YeM k 

Fix I. Let Pi be defined by 

PAY] = p kl [Y], YeM k \ M fe _!, kN, 

where we define Mo to be the empty set. Then Pi G Since M = Ufc>i M k , we have that the 
following holds provided that k is sufficiently large 

V(x) < sup J x (Y,pi) 

< sup J x (y,fii) + e 
YeM k 

= max sup J X (Y, Pji) + e 

< inf sup J X (Y,P) + 2e, 
/3er ; YeM k 
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where the last inequality follows from (|2.20p . This shows that, for every I, V(x) < liming Vki(x). 
The result follows. □ 

Proof of Theorem II. 1L The statement that U and V are solutions of (II. lj) follows from 
Theorem 12.11 Lemmas 12.31 12.41 and uniqueness of solutions of (jl.ip . established in [91]. The 
latter result also yields U = V. □ 



3. Equicontinuity 



In this section we prove Theorem 12.11 With an eye towards estimates needed in Section [J] we 
will consider a somewhat more general setting. Thanks to Lemma [2.21 we may, and will use the 
value functions (12. 7h , (12. 8j) , defined using simple controls and strategies (Definition 12. 1|) . Given 
X defined as in CCS} for some Y,Z E M°, we let for 7 6 [0, 1), X 7 = X + 7W. Define r 7 and 
J 7 as below f)l .7[) but with X replaced with X 1 . Also denote by and the expressions 
in <|22D, H2IS) with J replaced with J 7 . We write t/ 7 = £7^ v F 7 = V^. Theorem O is an 
immediate consequence of the following more general result. 

Theorem 3.1. For some £ N, the family {V^ , C/ 7 ; n > n,2, 7 G [0, 1)} is equicontinuous. 



In what follows, we will suppress 7 from the notation unless there is a scope for confusion. We 
start by showing that the value functions are uniformly bounded. To this end, fix a G S m ~ l , 
and note that the constant process Y° := (o°, 1) is in M°. 

Lemma 3.1. There exists a constant c\ < 00 such that 

e[t(x, y°, zf] < ci, x e G, z e M°, 7 e [o, l). 



Proof. We only present the proof for the case 7 = 0. The general case follows upon minor 
modifications. Denote by mo the diameter of G. Fix T > uiq. By (jl.6p . with = a • on 
the event r > T one has 

[ T (l - a s )dW s + / T (l + a s )ds < a° ■ (X T - X ) < m . 
Jo Jo 

Consider the {^}-martingale, M t = J (l — a s )dW s , with (M) t = f Q (l — a s ) 2 ds. On the event 
(M) T <T, 

[ (1 + a s )ds = 2T- [ (l-a s )ds>T. 
Jo Jo 

So on the set {(M) T < T; r > T} we have \M T \ >T-m . Letting a = inf{s : (M) s > T}, 

P(r > T) (M)t <T)< P(\M TAa \ > T - mo) < < w (3.1) 

(T - to ) 4 (T - to ) 4 

We now consider the event {r > T; {M)j- > T}. One can find TO2,to-3 G (0, cxd) such that for 
all non decreasing, non negative processes {7*}, 

P(# s + % e (-too, too); < s < T) < TO 2 e" m3T , (3.2) 
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where if is a one dimensional Brownian motion. Letting jt = Jo(l + D s )(l + a s )ds, where 
Z = (B, D), we see that 

{r > T; (M) T > T} c {M s + 7s G (-mo, mo), < s < T; (M) T > T}. 

For u > 0, let = inf{s : (M) s > u}. Then, with % = 7s s , 

P(t > T; (M) T >T)< P(H S + %£ (-m , m ); < s < T) < m 2 e- m3T , 

where the last inequality follows from (|3.2p . The result now follows on combining the above 
display with (|3.ip □ 

The inequality J(x,Y° , Z) < \h\ 00 E{r(x^ Y°, Z)) + |g|oo, where |/i|oo = sup x |/i(x)| and 
\g\oo = sup x |c/(x)|, immediately implies the following. 

Corollary 3.1. There exists a constant c 2 < oo such that \ V^{x)\ V |C/^(x)| < C2, for all x £ G, 
7 € [0, 1) and n G N. 



The idea of the proof of equicontinuity, explained in a heuristic manner, is as follows. Let x\ 
and X2 be in G, let e = \xi — X2I, and let 5 > 0. Consider the game with bounded controls for 
which V n is the lower value function, for some n G N. Let the minimizing player select a strategy 
(3 n that is 5-optimal for the initial position x\\ namely supy GM o J Xl (Y, f3 n ) < V n (xi) + 5. 

Denote the exit time by t\ = t x1 (Y, (5 n ) and the exit position by £1 = X x1 (t\). Now, modify 
the strategy is such a way that the resulting control Z = (3 n [Y] is only affected for times t>T\. 
This way, the payoff incurred remains unchanged. Thus, denoting the modified strategy by /3 n , 
we have, for every Y G M° , 

J xl (Y,(3 n )<V n ( Xl ) + 5. 

Given a point £2 located inside G, e away from £1, and a new state process which, at time t\ 
is located at £2, the modified strategy attempts to force this process to exit the domain soon 
after t\ and with a small displacement from £2 (provided that e is small). 

Let now the maximizing player select a control Y n that is 5-optimal for playing against f3 n , 
when starting from X2- This control is modified after the exit time T X2 (Y n ,p n ) in a similar 
manner to the above. Denoting the modified control by Y n , we have 

V n (x 2 )< J x *(Y n J n ) + 5. 

Hence V n (x 2 ) - V n (xi) < J X2 (Y n ,/3 n ) - J X1 (Y n , f3 n ) + 25. One can thus estimate the modulus 
of continuity of V n by analyzing the payoff incurred when (Y n ,f3 n ) is played, considering 
simultaneously two state processes, starting from x\ and x 2 . The form (jl.6p of the dynamics 
ensures that the processes remain at relative position x\ — x 2 until, at time <r, one of them 
leaves the domain. The difference between the running payoffs incurred up to that time can 
be estimated in terms of e, the modulus of continuity of h, and the expectation of a. It is not 
hard to see that the latter is uniformly bounded, owing to Corollary 13.11 and the boundedness 
of h away from zero. By construction, one of the players will now attempt to force the state 
process that is still in G to exit. If one can ensure that exit occurs soon after a and with a 
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small displacement (uniformly in n), then the running payoff incurred between time a and the 
exit time is small, and the difference between the terminal payoffs is bounded in terms of e 
and the modulus of continuity of g, resulting in an estimate that is uniform in n. 

This argument is made precise in the proof of the theorem. Lemmas 13.21 and 13.31 provide 
the main tools for showing that starting at a state near the boundary, each player may force 
exit within a short time and with a small displacement. To state these lemmas we first need 
to introduce some notation. 

We have assumed that G is a bounded C 2 domain in M. m . Thus there exist p G (0, |), k G N, 
Zj G dG, Ej G 0{m), G C 2 (R m - 1 ), j = 1, . . . , k, such that, with Bj = M p (zj), j = 1 . . . , k, 
one has dG C U* =1 B,-, and 

G n Bj = {E jV : yi > 0(2/2, ••-,2/m)}nB j , j = l,...,k. 

Here 0(m) is the space ofmxm orthonormal matrices. Define for j = 1, . . . , k, <j5j ■ M m — * M 
as 

<Pj(y) = yi- €j(y2, ■ ■ ■ , ym), y g m" 1 . 

Let tpj(x) = tpjiE^x), x e R m . Then \Dtpj(x)\ >l,xe K m , j = 1,... ,k. Furthermore, 

G n Mj = {x : ipj(x) > 0} n Bj, j = 1, . . . , k. 
Let < po < p be such that <9G C Uj =1 B po (zj). For e > 0, denote 

X e = {(xi, x 2 ) : xi € 9G, x 2 € G, \xi - x 2 \ < e}. 
Let j : dG — ► {1, . . . , k} be a measurable map with the property 

x £ Bpo^Vj.)), for all x G OG. 
For existence of such a map see, for example, Theorem 10.1 of jjj. Then, for every e < pi := 

4 ' 



(xi,X2) G implies B pi (xi) C Bj( a . 1 ), i = 1,2. (3-3) 



For j = 1, . . . , k and x G B^, define : M m ^ M as 



^J (x) = ipj(x) + \x- x \ 2 . 
Also, note that \D^°\ > \ in Bj. Define vrj° : R m -> 5 m_1 such that it is Lipschitz, and 

15*531- (3 ' 4> 



Given a strategy /3, and a point x 2 , we seek a control Y = (A, C) that forces a state process 
starting from x 2 to exit in a short time and with a small displacement from X2 (provided 
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that X2 is close to the boundary). We would like to determine Y via the functions ttj just 
constructed, in such a way that the following relation holds: 

A(t) = n(X(t)), C(t) = c°, (3.5) 

where ir = tt x ? and c° > is some constant. Making A be oriented in the negative direction 
of the gradient of <pj allows us to show that the state is 'pushed' towards the boundary. The 
inclusion of a quadratic term in ipj° ensures in addition that the sub-level sets {ipj < a} 
are contained in a small vicinity of xq, provided smallness of a and dist(xo, dG). The latter 
property enables us to show that the process does not wander a long way along the boundary 
before exiting. 

The difficulty we encounter is that due to the feedback nature of A in (|3.5p we cannot ensure 
(local) existence of solutions of the set of equations (jl.6p and (|3.5p . Take for example a strategy 
(3 that is given as /3[Y]t = b(Yt), Y E M°, where b is some measurable map from TL to TL. Along 
with equation (jl.6p . and (|3.5p this defines X as a solution to an SDE with general measurable 
coefficients. However, as is well known, the SDE may not admit any solution in this generality. 
To overcome this problem we will construct a Y that approximates the Y we seek in (|3.5p via 
a time discretization. 

Let U e denote the collection of all quintuples a = (a, £i, £2, P, Y) such that a is an a.s. finite 
Tt stopping time, £1 and £2 are jFo-measurable random variables satisfying (£1,^2) G X £ a.s., 
P G r°, and Y G M°. Fix 7 G [0, 1). 

Let cr = (a,^\,^2iPiY) G S P1 be given. Let j*(ui) = j(^i(u>)). Denote 

* = •) = ipj*^), * = *(u>, •) = n = n( u , •) = 

We define a sequence of processes (Z^,yW)j>o as follows. Let Y"/ ^ = , ) be given by 

(Y t , t < a, 

y (0) = 

[(77(6),c°), t>a. 

The constant c° above will be chosen later. Denote (B^°\ D^) = P\Y^ ']. Define, for t > cr, 

= 6 + / 1 M ( S )([4 0) " Bf)]dW s + 7 <W S ) + jf [Cf + Df][4°) + J3W]cto. 
The process can be defined arbitrarily for t < a. Set r/o = cr. We now define recursively, 
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for all i > 1, 

Vi = (Vi^i+e)/\mf! [ t>ri^ 1 :\xt 1) -X^\>e or jf* R®ds > t - t&_i}, (3.6) 

where 2$) = [Cf" 1 ) + D^pO^- 1 )) • - n(X^)], 

X ^=xt 1 \ Y®=Yt 1 \ 0<t< Vi , 



Y^^{4\cf) = {n{X^%c% t>Vi, (3.7) 

(£(*),£>») = /?[y (i) ], 

xf = X^ + / ![*,*] (*) ([4° - S«]dW. + 7 dW.) + f[Af + B«][<7« + £>«]cfe, t > 

It is easy to check that r/o < ?7i < • • • and — » oo a.s. Define X^ = X^ , Yj = Yj if t < 
Let p = p\ and 

r p = inf{t > a : <F(X t ) > p}, 

t g = ini{t >a:X t £G c }, (3.8) 
r = r p A r G . 



Define 



It, * < r, 



Y t = (A t ,C t 



(a°,c°), t>r, 
(B,D) = (3(Y), 
where a is as fixed at the beginning of the section. Let 

X t = b + J l M (s)([i s - B s \dW s + 7 dW s ) + J\c s + D S ][A S + B s ]ds. 

We write 

X = X[<t,Z 1 ,Z 2 ,P,Y], ¥ = 



Note that if f p and tg are defined by (|3.8p upon replacing X by X then f := f p Af<3 = r p Arc = 
r, because X differs from X only after time r. We write f = f[a, £i, £2> A Y]. Similar notation 
will be used for f p and f G . 

Lemma 3.2. There exists a c° G (0, 00) and a modulus ■d such that for every e G (0,pi), and 
7 G [0, 1), i/er = (cr,£i,£ 2 ,/?, Y) G E e and f G = f G [cr], one has 

(i)E{f G - a |^ CT } <0(e), 



fit; £{|X - 6|*,r G l-?v} < ^ w/iere |X - £ 2 |*,r G = sup^^ |X(t) - £ 



2 
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Proof of the lemma is provided after the proof of Theorem 13.11 

Next we construct a strategy (3* G r° with analogous properties. Here, existence of solutions 
is not an issue, and discretization is not needed. 

Fix (x 1 ,x 2 ) G X pi . Let j = j(a?i), & = , and 6 = vrf . Given Y" = (A,C) G M°, let X 
solve ^ ^ 

X t = x 2 + J ([A s - U(X s )]dW s + 7 d^ s ) + J Q [C S + d°][A s + n(X s )]ds, 
where d° is a constant to be determined later. Let 

t p = m£{t : l(i t ) > p}, 

r G = inf{t : X t G G c }, (3.9) 
f = T p A r G . 

Define Z* G M° as 

f(n(X s ),d°), s<r, 

[(a°,d°), s>r. 
Note that /3*[Y](s) := Z s , s > defines a strategy. Let 

X* t = x 2 + J o (i A s - B* s ]dW s + 7 dW s ) + y* [C. + d°] [A s + B*]ds. 

Define rf, r G and r* by replacing X with X* in (j3.9|) . and note that r* = r. To make the 
dependence explicit we write 

X* = X*[x 1 ,x 2 ,Y,W], Z* = Z*[ Xl ,x 2 ,Y,W], t* = t*[ Xi ,x 2 ,Y,W]. 

Lemma 3.3. There exists d° G (0, oo) and a modulus $ such that for all e G (0, pi), Y G M°, 
(x\,x 2 ) G X £ , if Tq = Tq[xi,x 2 ,Y,W], one has 

(i) E[tq] < 0(e), 

(ii) E[\X* - x 2 \\ T *} < 0(e), where \X* - x 2 |*, T * = sup ie[0ir *] \X*(t) - x 2 \. 

The proof of Lemma 13.31 is very similar to (in fact somewhat simpler than) the proof of 
Lemma 13.21 and therefore will be omitted. 



If a is an a.s. finite {^jj-stopping time and (^1,^2) are .Tv-measurable random variables 
such that (£i,£ 2 ) G X Pl a.s., then we define the Qt = J^t+a adapted processes 

X* t =X*[£ 1 ,bX,Wa](t), 
Z* = Z*[^^ 2 XM](t), 
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Fig 1. 



where Y a (t) = Y(t + a) and W a (t) = W(t+o) — W(a), t > 0. To make the dependence explicit, 
write 

X*=X*[a,Z 1 ,&,Y], Z* = Z*[a, 



Proof of Theorem 12.11 Fix xi,x 2 G G and 7 G [0, 1). We will suppress 7 from the notation. 
Assume that \x\ — x 2 \ = e < p±, so that Lemmas 13. 2\ 13.31 are in force. Let no be large enough 
so that l n , k n > max{c°, d } for all n > no- Given 5 G (0, 1) and n > no, let (3 n £ l be such 
that 

sup j^/r)-^ <k(xi) <ci. 

For Y G M° write r 1 ' n (y) := r^Y,/?") and X l > n (Y) := X Xl (Y,(3 n ). Note that 

hE[ T l ' n {Y)\ - < Cl + 1, 

hence for every n and every Y G , 

where mi < 00 is a constant that does not depend on n. 
Define /3 n G TP , as follows. For Y G M°, let 



£> n (Y) = XXnnnOn, & n (Y) = ti n (Y) +x 2 - x u 



-l,n 



(P n [Y]t 



t < r 1 ' n (Y), 



(3 n [Y]t = <Z*[r^(Y),e{ n (Y),e 2 ' n (Y),Y}, t > r^(Y), ^ n (Y) G (5, 



(3.10) 



t arbitrarily defined, i > r 1 ' n (Y), £' n (y) G G - 
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Note that for every Y G M° n , J X1 (Y, /3 n ) = J xl (Y,(3 n ). Next, choose Y n G M% n such that 

v n (x 2 )< sup j^(y,^) < r*(Y n J n ) + 8. 

Let r 2 ' n = T X2 (Y n ,(3 n ), and X 2 > n = 1^(7", /3 n ). Let 



£ 2 =^ T 2,n, £l' =£ 2 +Xl-X 2 . 



Define Y n G M,° as 



arbitrarily defined, t > T 2,n , £ 2,n G (5 C . 

Note that J X2 (Y n ,(3 n ) = J X2 (Y n ,p n ). Thus 

V n (x 2 ) - V n ( Xl ) -25< J x *(Y n ,p n ) - J x i(Y n J n ). (3.11) 

For k = 1,2, let 

a fe ' n = r Xfc (F n , ( a n ), X k ' n = X Xk (Y n ,(3 n ), ~ k > n = X*' k n n . 

For m-o > let i? 9 (mo) = sup{|g(x) — g(y)\ : x,y G <9G, |x — y| < mo} and 4(wo) = 
sup{|/i(x) — h(y)\ : x,y £ G,\x — y\ < mo}. Using (13,10p . the r.h.s. of (13. lip can be bounded 
by 

EVg^S 1 '" - ~ 2 > n \) + c 3 ^(e) + |/i|oo-E[(^' n V a 2 '™) - (a 1 '™ A a 2 '")]. (3.12) 
On the set a 1 ' 11 < a 2 ' n we have IS 1 '" - ~ 2 > n \ <e + \X 2 ^ n - Xj n „|. Hence by Lemma EP), 

E[\S^-S 2 ^\ 2 l W ,n^ n} ] <M^, 

for some modulus Using Lemma 13.2( h). a similar estimate holds on the complement set, 
and consequently, the first term of (|3.12j) is bounded by $2(^)5 for some modulus $2- By 
Lemmas I3.2f i) and I3.3f i). the last term of (|3.12p is bounded by |/i|oo($(e) + $(£))■ Hence 
Vn{x2) — U n (xi) < 25 + t?3(|xi — X2I) for some modulus $3, and the equicontinuity of {V^; n, 7} 
follows on sending 5 — > 0. The proof of equicontinuity of {U2',n,j} is similar, and therefore 
omitted. □ 

Proof of Lemma 13.21 We will only present the proof for the case 7 = 0. The general case 
follows upon minor modifications. Denote 

■*/>l,oo= SUP SUp \Dl/)j°(x)\, "02,00= SUp_SUp|D 2 0j°(x)|, 
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and let <pi )O0 , ^2,00 be defined analogously. Let e > and a = (a,^i,^2, P,Y) £ U £ be given, 
let X = X[o\, Y = Y[cr], f p = f p [cr], tq = tq[(t], and f = f[cr]. Let 

To = inf{t > a : W(X t ) < 0}, f B = M{t > a : X t Bj*}, 

where we recall that j* = i(£i). We have f = f p A tg < f p A fo, because & > <P. Also, by (|3.3p . 
f <fg. By Ito's formula, for t > a, 

¥(X t ) = + / i M ( s )^(^)[^ - 

+ /W(A > s )[^ + J B s ][C s + ^s + ^ / [A a — 5 s ]'D 2 !f'(X s )[A s — I? s ]ds. 

J cr ^ J a 

For i < r, using and (f377|) . 

DV(X S )[A S - B s ] = DV{X S )[II(X S ) - B s ] + D$(X a )[A 9 - II(X S )} 
= -\D¥(X a )\(l-a a -6 a ), 

where a s = -\D$(X S )\- 1 DV{X S ) ■ B s , and 5 S = -II{X S ) ■ [A s - II(X S )]. Note that \a s \ < 1. 
Moreover, using the inequality |A • (A — A)| < 2|w|~ 1 |u — «| along with (|3.6p . recalling the 
definition of A and the fact {D&^X^l > 1/2, we see that 

\$s\< 46^2,00 • 

Furthermore, 

£><F(X S )L4 S + B S ][C S + A*] = DV(X s )[n(X s ) + £ S ][C S + A.] + DV(X S )[A S - II{X S )}[C S + D s ] 

= -\LW(X B )\ (1 + a s )(c° + D s ) + e s , 



where, by (|3.6p . for cr < ti < £2 < r 

rf 2 



e s ds < e + i2 — 



ti 



Finally, we can estimate 



p s := ~[A S - B S ]'D 2 V{X S )[A S - B s ] 
by \p s \ < 2^2, oo- Shifting time by a, we denote Qt = Ft+a, Wt = Wt+ a — W a , and 

{X t , D t , a t , S t , e t , ft ) = (X+<r , A+<j , +<j , S t+(7 , e t+(J , p i+cr ) . 
Denote also m t = \DHf(X s )\, let M be the ^-martingale 

M t = - f m,(l - a s - 5 s )dW s , 
Jo 



and set 



Mt := (M)t = f m 2 s (l -a s - 5 s ) 2 ds, (3.13) 
Jo 
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Pt = - [ m s {l + a s )(c° + D s )ds, Q t = f (e s + p s )ds, & t = V(X t ). 
Jo Jo 

Combining the above estimates, we have for 0<s<t<f — a, 

V t = V Q + M t + P t + Q t (3.14) 

Qt-Qs <s + r(t-s), (3.15) 

where r = 2ip2, oo + 1- Note that mt > 1/2 for s < tb — cr, and recall that tb > f = f p A tq. 
We have for t < f — a, assuming without loss of generality ^£1^2,00 < 1/32, 

r = r - (1 - a t - S t + 1 + a t + 5 t ) 2 < T -{1 - a t - kf + 2r(l + a t + S t ) 
< 2rmf(l -a t - 5 t f + 4rm t (l + a t ) + 2r^, 

and 



1 .. .. . . 1 

2 



& < (1 + a t ) + -{l-a t - 5 t ) 2 < 2m t (l + a t ) + -m t 2 (l - a t - k? ■ 



Hence for t < r — a, we have r < 3rm^(l — at — 5t) 2 + 8rmt(l + at). Thus by (|3.14p . if c° is 
chosen larger than 8r, we have 

&t = % + M t + 3r^ + P t + Q t , (3.16) 

where 

rt 

Pt = ~ m s (l + a s )(c° + D s -8r)ds, 
Jo 

Qo = 0, and 

Pt ~ P s < 0, Qt-Q s <£, 0<s<t<f-a. (3.17) 

We will write P for f [• \Gq\, and £ for the respective conditional expectation. 
The proof will proceed in several steps. 
Step 1. For some v\ G (0, 00) 

sup E[(f — a) 2 ] < u\, a.s.. 

Step 2. For some ^2 S (0, 00) 

sup E[(f G - a) 2 ] < u 2 , a.s.. 

Note that Step 2 is immediate from Step 1 and Lemma [3J] because by construction, a constant 
control is used after time f . 
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Step 3. There exists a modulus $1 such that 

sup P[f G -a> ^(e), f p > f G ] < i?i(e), e > 0. 

Step 4. There exists a modulus $2 such that 

sup P[f p < f G ] < tf 2 (e), e > 0. 

Based on these steps, Part (i) of the lemma is established as follows. Writing E e for the 
event f G — o > i?i(e), 

E[f G -a}= E[(f G - a)l Ee ] + E[(f G - a)l E g] 
< [E[(f G -a) 2 ]P(E £ )] 1/2 + Me) 

where the first inequality uses Cauchy- Schwartz, and the second uses Steps 2, 3 and 4. 
To show Part (ii) of the lemma, use Steps 3 and 4 to write 

E[\X - 6|^ G ] < E[\X - £ 2 |2 fG l Wp>fG} ] + [Me) + Me)] diam(G) 2 . (3.18) 
By (I3.14p and A3. 15f> we can estimate 

E[ sup &(X t )l E c n{fp>fG} ] <^ li0o£ + 3^ 1 (e) 1 / 2 ^, £ + n? 1 (e)+e. (3.19) 

te[a,T G ] 

Thus noting that $(Xt) > on {f p > f G ; t £ [a, f G ]}, we have on this set, 

*{Xt) > \Xt-&\ 2 . 

Part (ii) of the lemma now follows on using the above inequality and (|3.19|) in (|3.18|) . 

In order to complete the proof we need to establish the statements in Steps 1, 3 and 4. 
Proof of Step 1. Let t be given. Let F t denote the event {\P S G (0,p), < s < t}. We have 

P(f - a > t) = P(f-a > t, f - a > t,f B - a > t) < P(F t ,f B -a > t). (3.20) 

Denote S u = inf{s : fi s > u}, where we recall that the infimum over an empty set is taken to 
be 00. Let k S (0, 1/16). Then 

P(F t ,f B -a>t,nt> Kt) < P{p t > Kt, l^s s G (0, p), f B ~ <? > t, < s < p t ) 

< P(% + H S + 3rs + P S £ (0, p), < s < Kt) 
= P(H S + P S £ (0,p), < s < Kt), (3.21) 
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where H is a standard Brownian motion (in particular, H s = Ms a for s < fit), Pt is a process 
that satisfies P s — P u < e, u < s, and H s = + H s + 3rs. On the event indicated in the last 
line of (|3.2ip . one has, for every integer k < Kt, that — Hk-i > —2. Hence the r.h.s. of 
(|3.2ip can be estimated by m\e~ m2Kt , for some positive constants m\ and 777-2, independent of 
t and k, e, and result, 

P(F t ,f B - a > t,fH > Kt) < m ie - m2Kt . (3.22) 



Next, on the event F t n {t# — a > t, m < Kt} we have f m s (l — a s — 5 S ) ds < nt, thus 

/* m 2 s {l - a s ) 2 ds <2 K t + 2 f m 2 s 5 2 ds < (2k + lfyl^i/j* e?)t < ~ A , 
Jo Jo 4 



where we assumed without loss that 2K+16ip 2 ^ip 2 oo^ 2 — 1/4- Consequently, f m s (l — a s )ds < 
t/2. Using m s > 1/2, we have Jq(1 + a s )ds > 2t — t = t, whence, letting c° be so large that 
c° - 8r > 8r, 

3r/^t + Pj < 3rt — Art = —rt, 

where we used fit < Kt < t. Using A3. 16H . on this event we have < W t < $0 + Mt — rt + Qt- 
Hence recalling that Q t < e and l/?o < ipl,oc£, denoting mo = "01,00 + 1, and letting 7 be the 
(i-dependent) stopping time 7 = inf{s : fi s > Kt}, we have, for t > r~ 1 mo£, 

P(F t ,f B - a > t,fit < Kt) < P(M t >rt- m e, fi t < Kt) < P(M tM > rt - m e) 

< 7 & IW^l < ,"»(">' (3.23) 
(rt — niQep (rt — moff) 

In the second inequality above we used the fact that fit < Kt implies 7 > t, and in the third 
we used Burkholder's inequality. In particular, 777,3 does not depend on t or k (which will allow 
us to use this estimate more efficiently in Step 3 below). Combining (I3.20p . (13.21R and ()3.23|) 
we obtain the statement in Step 1. 

Proof of Step 3. We begin by observing that, from (13.211) . 

P(Ft, f B -a>t,fi t > Kt) < P(H S + P s > 0, < s < Kt), (3.24) 

and since &o + P s < ra^e, this probability is bounded by 

p(e, Kt) := P(m e + H s + 3rs > 0, < s < Kt). 

The latter converges to zero as e — > (for fixed k and t). Let I be a modulus such that 
p(e,#(e)) < 0(e), and p(e) 1 / 4 > mo£. Taking t = r 1 (i?(e)) 1//4 and k = r7?(e) 3//4 , combining 
(1^231) and <^M), 

P(F t ,f B -a> r-^(e) 1/4 ) < + .iT^ilu = (1 + 16m 3 )0(e). 



(\m i/A ) 



Using the above estimate in (I3.20p . Step 3 follows. 
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Proof of Step 4. For a > 0, let r a and tq denote the first time [a, oo), and respectively, (— oo, 0], 
is hit by H. Since H is a Brownian motion (with drift 3r) starting from H(0^ ^ ^1,00^5 we 
have that P(r p - £ < tq) converges to zero as e —> 0. The proof is completed on noting that 

P(f P < f G ) < P{t p - £ < t ), 

which follows from (|3. 16j) . (13.170 . the relation H s = ^o+Ms a +3riJ,s a for all s < fioo = sup t>0 
and observing that on the set where o"o = sup{5 s : s < fioo} < oo we have that M t + 3rfi t = 
M CT0 + 3r/i CT0 , for t > cr . □ 



4. Analysis of the game with bounded controls 



The main result of this section, Theorem l4.lt implies Lemma [2. 31 Fix k, I such that mm{k, 1} > 
max{c°, dr, k n2 , l n2 }, where 712 is as in Theorem 13.11 Throughout this section, (A;, I) will be 
omitted from the notation. As in the previous section, only simple controls and strategies will 
be used. Recall that 



$(a, b, c, d;p, S) 
Fix 7 6 [0, 1) and write 



1 



(a - b)'S(a -b) - (c + d)(a + b) • p. 



At(p,S)= max min <P(a,b,c,d;p, S) — — Tr(S'), 



|o|=l,0<c<fc \b\=l,0<d<l 



A (p,S)= min max <P(a,b,c,d;p, S) — — Tr(iS), 



\b\=l,0<d<l \a\=l,0<c<k 



and consider the equations 



A+(Du,D 2 u) - h = iaG, 

u = g on dG. 

A-{Du,D 2 u)-h = inG, 



u = g 



on dG. 



(4.1) 



(4.2) 



We will write V 1 and, respectively, U 1 for the functions and introduced at the beginning 
of Section 

Theorem 4.1. For each 7 £ [0, 1), one has the following. 

(i) The function U 1 uniquely solves equation ()4.ip . 

(ii) The function V" 1 uniquely solves equation (|4.2p . 
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The proof of the theorem is based on a result on a finite time horizon, Proposition 14.11 in 
which we adopt a technique of 11]. Given a function u G C(G), xq G G, T > 0, and V G M°, 
Z G M°, let 

J 7 (a; ,T,u,y,Z) = S / /i(X 7 )ds + up^. 7 ) , (4.3) 

where X 7 and r 7 = t(xq,Y,Z) are as introduced in Section [3] with = xq, Y = (A,C) and 
Z = (B, D). 

Proposition 4.1. Let x G G, T G [0, do) and 7 G [0, 1). Let u G C(G). 
(%) If u is a subsolution of (14. ip f/ien 

tt(xo) < sup inf J 1 (xQ,T,u,a[Z], Z). (4.4) 

(m) If u is a supersolution of (14. ip i/ien 

n(xo) > sup inf J 1 (xo,T,u,a[Z], Z). (4.5) 

aer o Z6M» 

(mj If u is a subsolution of (14.21) i/ien 

u(x ) < inf sup J 7 (x ,y,r,n,/3[y]). (4.6) 

(w,) If u is a supersolution of (|4.2p i/ien 

u(x ) > inf sup J 7 (x ,T,u,y,/3[y]). (4.7) 



Before proving Proposition 14. 1\ we show how it implies the theorem. 

Proof of Theorem 14.11 We only prove (i) since the proof of (ii) is similar. We first argue 
that any solution of (|4.ip must equal C/ 7 , and then show that a solution exists. Let a solution 
u of (|4.ip be given. Fix xq G G and e > 0. Fix a G i^. such that 

C/ 7 (x ) < inf J*°{a,Z)+e. (4.8) 

By Proposition 14. l( ii) . 

u(x )> inf P(T,Z), (4.9) 

where we denote 

f(T,Z) = r(xo,T,u,a[Z],Z). 

For the rest of the proof we suppress 7 from the notation. Lemma 13.11 shows that there is 
mi < 00 such that, for every T G [0, 00), inf ZgM o j(T, Z) < m%. Letting M(T) = {Z G Mf : 
j(T, Z) < ci}, it follows from the lower bound on h that for some T < 00 that does not depend 
on Z, one has P{t > T) < e for all Z G M(T), where r = T x °(a,Z). Fix such a T. Given 
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Z £ M(T), let Z £ Mf be equal to Z on [0, T), and let it assume the constant value (a , 1) on 
[T, oo). Clearly j(T, Z) = j(T, Z). Also, by Lemma 13. 11 denoting f = r x °(a, Z), we have 

E[(t-T) + \t >T}< m 2 , 

for some constant m2 independent of e and T. By (|4.3p . (|4.8p . definition of the payoff and using 
the boundary condition u\qq = g, we have for some 7713 G (0, 00) 

U(x ) < J xo (a, Z) + e< J(x ,T, u,a[Z],Z) + m 3 {E[(f - T)+] + P(f > T)} + e. 

Using P{f > T) = P(t > T) yields U{x ) < j(T, Z) + m 4 e = j{T, Z) + m 4 e. Note that the 
infimum of j(T, Z) over is equal to that over M(T). Thus using (14. 9p and sending e — > 
proves that J7(xo) < ^(^o)- 

To obtain the reverse inequality fix xq £ G. From Lemma 13.11 there exists 1715 < 00 and 
Z\ € Mf such that, for every a, 

J X0 (a,Z!) <m 5 . 

Denote N{a) = {Z : J x °(a,Z) < m§}. Clearly for each a, the infimum of J x °(a,Z) over all 
Z G Mj° is equal to that over Z G ^(a). Hence 

[/(x )> inf J^(a,Z), aer°. 

ZeN(a) 

Using the positive lower bound on h as before, it follows that there exists a function r : [0, 00) — ► 
[0,oo) with lmiT^oo r(T) = 0, such that for every a and Z £ N(a) we have P(r xo (a, Z) > 
T) < r(T). Therefore for some tuq £ (0, 00) 

J X0 (a,Z) > J(x ,T,u,a[Z],Z) -m 6 r(T), a £ lf k , Z £ N(a). 

In conjunction with Proposition I4.1( i). this shows that U(xq) > u{x$) — m,Qr(T). Since T is 
arbitrary, we obtain U{xq) > u(xq). 

Finally we argue existence of solutions to equation (|4.ip . Let us write (|4.ip 7 for equation 
(|4.ip with a specific 7. For 7 £ (0,1), existence of solutions to (|4.ip 7 follows from Theorem 
1.1 of [4|. To handle the case 7 = 0, we will use the fact that any uniform limit, as 7 — > 0, 
of solutions to (|4.ip 7 is a solution to (|4.1|) n. This fact follows by a standard argument, that 
we omit. Now, since for 7 £ (0, 1) we have existence, the uniqueness statement established 
above shows that C/ 7 solves (|4.ip 7 . From Theorem 13. II we have that the family {C/ 7 ,7 £ (0, 1)} 
is equicontinuous, and thus a uniform limit of solutions, and in turn a solution to (|4.ip n. 
exists. □ 

In rest of this section we prove Proposition 14.11 

Let G n be a sequence of domains compactly contained in G and increasing to G. Let J 7 be 
defined as J 7 of (14. 3p . with t 7 = r 7 (xo, Y, Z) replaced by r 7 = r 7 (xo, Y, Z), where 

r 7 = inf{t : X? £ 8G n }. 
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Lemma 4.1. For every n, and 7 G [0, 1) Proposition \4-l\ holds with J 7 replaced by J 7 . 



Proof. We follow the proof of ll|, Lemma 2.3 and Theorem 2.1]. Assume without loss that 



Go CC G\ CC G2 CC G. We will prove the lemma for n = 0. Since the claim is trivial if 
xq Go, assume xo £ Go- In this proof only, write r for Tq, the exit time of X 7 from Go- Fix 
7 > 7, let f = and <r = r A f. For e > consider the sup convolution 



u s (x) = sup I u(£) k x e G 2 , 



where, in the above equation only, u is extended to R m by setting u = outside G. It is easy to 
see that there exists £0 such that the supremum is attained inside G for all (x, e) £ G2 x (0, Eq). 
The standard mollification id ■ G\ —* R of u e : G2 — > R is well defined, provided that <5 is 
sufficiently small. The result [111 . Lemma 2.3] for the smooth function u s £ and the argument in 
the proof of [ill . Theorem 2.1] show 



r /' TAct - x - 1 

(x )<sup inf £ / /i(X 7 )ds + <(X^ A(T ) +p(e,5, 7 , 7 ), 



al. , ... 




where lim e _^o lim^,^ 7 lim^^o p(. £ > ^ 7j 7) = 0- We remark here that Lemma 2.3 of [ll|] is written 
for the case where u is a subsolution of a PDE of the form (|4.ip on all of R m and T A <r is 
replaced by T, however the proof with u and T A a as in the current setting can be carried out 
in exactly the same way. Since Go is compactly contained in G±, we have that for every 6 > 

sup sup IP (\T /\o -T /\t\+ sup \X] - 
aer°zeM° { \ o<s<t 

converges to as 7 — > 7. Moreover, — ► u £ as (5 — > and n e — ► n as e — > 0, where in both 
cases, the convergence is uniform on Go (see ibid.). Hence the result follows on taking 5 — > 0, 
then 7 — > 7, and finally e — > 0. □ 

Proof of Proposition 14.11 The main argument is similar to that of Theorem 12.11 and so we 
omit some of the details. We will prove only item (iv) of the proposition, since the other items 
can be proved in a similar way. 

Fix x and T. Let ttbea supersolution of (|4.2j> . Let n be large enough so that dist((9G n , dG) < 
pi. Write H{Y,I3) for JJ{x,T,u,Y, 0[Y]) and p{Y,j3) for J 7 (x, T, u, Y, fl[Y]). Below we will 
keep 7 in the notation only if there is scope for confusion. By Lemma 14.11 u(x) > v n := 
infg supy j n (Y, (3), for every n. We need to show n(x) > v := inf/j supy j(Y, 

Fix e > 0. Let /3 n be such that 

supi n (y,/3 n ) < v n + e, (4.10) 

and let rf(Y) = r 7 (x, Y, (3 n [Y]), Y £ Mf. Let (3 n be constructed from /3 n as in the proof of 
Theorem l2.lt where in particular, /3 n [Y] and /3^[Y] differ only on [rf , 00), by which j n (Y, f3 n ) = 
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jn{Y,/3 n )- Choose Y n such that 

v < sup j(Y, p n ) < j(Y n , n ) + e, 

Y 

and set r 2 n (y) = r 7 (x, Y, n [Y]). Then 

v - v n - 2e < j(Y n ,]3 n ) - j n (Y n , P„) =: S n . 

Denote X n = X x (Y n , (3 n ). Using Lemma I3T2"! 

< r 2 n - 7? < e and |X n (r 1 n AT) — X n (r 2 n A T)\ < e, 

with probability tending to 1 as n — > oo. It now follows from the definition of J n and J 
(cf. (jO]l ) that lim sup n 5 n < /f(^) for some modulus p. Since e is arbitrary, this proves the 
result. □ 



5. Remarks on optimal solution trajectories 



Recall from (CD]) that 



$(a,b,c,d;p,S) = -~{a-b)'S(a-b) - (c + d)(a + b) ■ p, (5.1) 

and denote 

A + (p,S)= sup inf $(a,b,c,d;p,S), (5.2) 

|b|=l,0<(i<oo |o|=l,0<c<oo 

A~(p,S)= inf sup <P(a,b,c,d;p,S) (5-3) 

|o|=l,0<c<oo \b\=l, 0<d<OD 

(compare with (|2.2p - (|2.3p ). The following proposition is a consequence of Lemma 12.11 The 
result states that for the SDG of Section Tl.21 the associated Isaacs condition, A + = A~, holds. 
Although we do not make use of this observation in our proofs (except in Proposition 15.21 
below), such a condition is often invoked in studies of games in showing that the game has a 



value (cf. [6(], [111]). 



Proposition 5.1. For p G R m , p / and S G S(m), A + (p,S) = A(p, S) and A~(p,S) 
A(p,S). 



Proof. We will only show A~ = A (the proof of A + = A being similar). Fix p, S, and omit 
them from the notation. Write TCk for {(a, c) G 7i : c < k} and 4>(y,z) for $(a, b, c, d), where 
y = (a, c), z = (6, d). Given 6 > let k be such that A~ > inf^g^ sup^ gW 4>(y, z) — 5. Then 



A > inf sup 4>(y, z) - 5 = A H 



Thus by Lemma |2.1[ A > A. 
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Next, let (j)(y) = sup 2gW <fi(y, z). Fix 5 G (0, oo), let ya = (p, <5 1 ), where p = p/\p\, and let 
-2<5 = (bs, ds) G be such that ^(ya) < 0(y<s, z<s) + 5- Then 

A' < 4>{y s ) <-\{P- bs)'S(p - b s ) - (5- 1 + d s )(p + b s ) ■ p + S < ~(p- b s )'S(p - b s ) + 5. 

Note that bs must converge to —p or else the middle inequality above will say A~ = — oo, 
contradicting the bound A" > A. Letting 5 — > 0, we now have from the third inequality that 
A~ < A. The result follows. □ 

In [j^] the authors raise questions and state partial results about the form of the limit 
trajectory under optimal play of the Tug-of-War game, as the step size approaches zero (see 
Section 7 therein). It is natural to ask, similarly, whether one can characterize (near) optimal 
trajectories for the SDG studied in the current paper. Our aim in the following discussion is 
only to present the main idea for such an analysis; the precise argument is work in progress. 

Let V be as given in (jl.lOp . Given 5 > 0, let (3 s G r be a <5-optimal strategy, and let Y s G M 
be a (5-optimal control for playing against (3 s . One may ask whether the law of the process X s , 
under (f3 s ,Y s ), converges to a limit law as 5 — > 0; whether this limit law is the same for any 
choice of such (/3 s ,Y 5 ) pairs; and finally, whether an explicit characterization of this limit law 
can be provided. A somewhat less ambitious goal, that we will discuss here, is to ask whether 
there exists some choice of <5-optimal ((3 s , Y 5 ) such that the limit law for the corresponding X s 
exists and can be characterized in a simple manner. 

The sketch that we present below aims at showing that this question can be reduced to the 
understanding of much simpler objects, namely A + and A~ of (|5.2p . (|5.3p . which represent the 
values of a 'static game'. 

A basic assumption will be that V is a C 2 (G) function, with nonvanishing gradient in G. 



a C 2 (G) 
7 of fl. 



A similar assumption is made in Section 7 of [9|. Under this assumption, by an application of 
Ito's formula to V(X) and noting that g(X T ) = V(X T ), the cost function J in (II. 9h can be 
written as 

,T iP(X s ,Y s ,Z s )ds , (5.4) 



o 



J(x,Y,Z) = V{x)-E 
where 

ip(x, y, z) = -h(x) + #(a, b, c, d; DV(x), D 2 V(x)), y = (a, c), z = (b, d). 

Hence 



inf sup E 



o 



il){X s ,Y s ,Z s )ds 



0, (5.5) 



where Z = (3\Y\. Since V satisfies (II. ip in the classical sense, and since by Proposition 15.11 
A + = A, we have 

sup inf tj)(x,y,z) =0, x G G. (5-6) 
z£ T-cy&H 

Equation (|5.6p will be used to construct the proposed (conjectured) nearly optimal strategy 
and control for V. Note that this equation is based on the fact that V solves the equation 
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A + — h = 0. This is in contrast to the analysis of Section 01 where the lower value function 
Vki was characterized as the unique solution of equation A^ — h = 0. Thus the validity of the 
Isaacs condition is crucially used in this argument. 

Given 5 > 0, let z s (x) = (b s (x) , d s (x)) be such that 

inf iP(x,y,z 5 (x)) E [-5,0], x E G. (5.7) 
yen 

Note that we must have b s (x) = —p(x), where p(x) = DV(x) and p(x) = , for otherwise 
the infimum above is — oo. For such a choice of 6, the last term of (|5.ip is given by 

- (c + d)(a + b) ■ p = (c + - o -P)\p\ > 0- (5.8) 

By positivity of this term we can (and will) assume that d s (x) > d^ 1 so that d s (x) — > oo as 
<5 -» 0. 

Next, let ^(a;) = (a s (x), c s (x)) be such that 

ip(x, y 5 (x),z s (x)) £ [-5, 5], xeG. (5.9) 
Note by that we can take c 5 (x) = 0. 

To define (3 & , let K = (A, C) be given, and consider the equation 

dX = (A - b 5 {X))dW + (C + d s (X))(A + b s (X))ds. 

Assuming that this equation has a unique solution (which is a question on whether 'nice' 
selection z s exists), this defines Z s = (b s (X), d s (X)), and hence the strategy [3 s . 

We claim that (3 s is c<5-optimal, for some constant c E (0, oo), namely, J(x,Y, (3 S [Y]) < 
V(x) + c5 for all 1" E M. We do not prove the claim here, however note that for every s, we 
have 4>(X s ,Y s ,z s (X s )) > -6, and thus E[ - J r t/j(X s , Y„ /3 s [Y} s )ds] < 5Et. Thus, if one can 
show that Et is bounded uniformly in Y and 6, the claim will follow. 

Next, consider the control Y s , determined by solving the equation 

dX = P s (X)dW + Q s (X)ds, (5.10) 

where 

P s (x) = a s (x) - b s (x), Q s (x) = (c s (x) + d 5 (x)){a s (x) + b s (x)), 

and setting Y s = (a s (X), c s {X)). Once more, additional properties of the selection y s ,z s are 
needed in order to ensure existence of solutions. 

We next claim that Y s is c<5-optimal for playing against /3 5 for some c E (0, oo), namely, 
V(x)-c5 < J(x,Y s ,p 6 [Y 6 }) < V(x) + c5. Since ip(X s , Y s , p[Y] s ) < 5 for all s, the proof of the 
above claim, that is omitted here, will be similar to the one for the c8 optimality of (3 s . 
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Thus one can interpret X s that is obtained by solving (|5.10p as a c5-optimal trajectory 
associated with the SDG. One would like to take 5 — > and characterize the limits of X s . This 
question regards the limit of the functions y s and z s of (|5.T|) and (|5.9p . 

In what follows, we will suppress x from notation in y s (x), z s (x) etc. We denote D 2 V(x) by 

S. 

Proposition 5.2. Let q = j^(Sp — (p, Sp)p) (note thatp-q = 0). Then on a sequence 5 n — ► ; 
pS n _^ 2p and Q Sn — * q pointwise. 

From (|5.1U|) and (|5.11|) . this leads to the following form for the limit trajectory, 

dX = 2p(X)dW + q(X)ds. 

This is precisely the form of the limit trajectory conjectured for the Tug-of-War game, as the 
step size approaches 0, in 

Proof. Following the proof of Lemma 12. II we see that as 5 — > 0, a s — > p. In particular 

a s -b s = a s +p^2p, as 5^0. (5.11) 
We now consider the limiting behavior of 

Q s = (c 8 + d 5 )(a s + b s ) = d s (a s - p). 

Write a 5 as 

where \rjs \ = 1, rjg ■ p = and 9$ £ [0,1]. For notational convenience we will write (1 — Of) 1 / 2 
as $5. Any choice of a 5 corresponds to a choice of (9s,i]s) £ 6, where 6 = [0, 1] x and 
S™" 1 = {«e S™- 1 : « • p = 0}. For (0, 77) £ 6, write 

*s(P, V) = ~[{l + e)P + er l ]'S[(l + §)p + 9r,]+d s (l-§)\p\, = (1-0 2 ) 1/2 - 

It is easy to see that $,5 (#5, 77,5) = $(ys,z$). We will assume, without loss of generality, that 
(0 s ,vs) G axgmin (fl>n ) e6 {*i(e,»7)}. Define for (0,??) £ S, *(0,r?) = %0t/Sti + (l + §)rfSp. Note 
that 

= -^-^-p'Sp-9^(9, v ) + - (9)|p|. 

Let, for £ [0, 1], rf(9) £ argmax^m-i ^{9, if). We claim that 77* (0) — ► q as — > 0. Indeed, 

note that q is the unique point in at which ij'Sp is maximized. Now suppose that as 

— ► 0, r/*(0) — > 17* (along some subsequence). Since \P(0, r?*(0)) > ^(9,q), we have on taking 
limit as — ► 0, (j]*)'Sp > q'Sp. The claim follows. 

Let for £ [0, 1], H{9) = f(r/*(0),0) and define 

G s {9) = - ^ + n 9)2 p'Sp - 9H(9) + d 5 (l - 9)\p\. 
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Let 8g E argmin0 g r Oil i Gg{9). Since (dg,f]*(9g)) G argmin™ ^^{^(fl, 77)}, we can take, without 
loss of generality, (9g,rjg) = {9* 5 ,n*{9* 5 )). Thus 

a* = (l-(^) 2 ) 1/2 P + W(^). 

We next claim that 

d s (l - 9g) -> as 5 -> 0. (5.12) 
To see this, note that G 5 (6»|) < Gg(0) = -2pSp. Thus 

-2p'5p > lim sup Gg{9*g) = -2p'Sp+\p\ limsup ^(1 - 9* s ). 
8^0 8^0 

This shows (|5.12p . Next, suppressing 5, by the Lagrange multipliers theorem, D<f>{a) = Xa for 
some A 6 8, where <f)(a) = <P(a, —p, 0, d;p, S). Thus —S(a + p) — dp = Xa, and 

u —\ , , a S(a + p) 
d(a — p) = da + At— r H j— : 

bl \P\ 
d(a ■ p — I) a ap'S{a + p) S{a + p) 
a ■ p \p\ a ■ p \p\ 

Since a — > p, the first term converges to zero using (|5.12p . whereas the sum of the last two 
terms converges to q. □ 
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